Accent Recognition with Hybrid Phonetic Features

نویسندگان

چکیده

The performance of voice-controlled systems is usually influenced by accented speech. To make these more robust, frontend accent recognition (AR) technologies have received increased attention in recent years. As a high-level abstract feature that has profound relationship with language knowledge, AR challenging than other language-agnostic audio classification tasks. In this paper, we use an auxiliary automatic speech (ASR) task to extract language-related phonetic features. Furthermore, propose hybrid structure incorporates the embeddings both fixed acoustic model and trainable model, making robust. We conduct several experiments on AESRC dataset. results demonstrate our approach can obtain 8.02% relative improvement compared Transformer baseline, showing merits proposed method.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dialect and Accent Recognition Using Phonetic-Segmentation Supervectors

We describe a new approach to automatic dialect and accent recognition which exceeds state-of-the-art performance in three recognition tasks. This approach improves the accuracy and substantially lower the time complexity of our earlier phoneticbased kernel approach for dialect recognition. In contrast to state-of-the-art acoustic-based systems, our approach employs phone labels and segmentatio...

متن کامل

Syllable-level desynchronisation of phonetic features for speech recognition

This paper describes a novel approach to speech recognition which is based on phonetic features as basic recognition units and the delayed synchronisation of these features within a higher-level prosodic domain, viz. the syllable. The object of this approach is to avoid a rigid segmentation of the speech signal as it is usually carried out by standard segment-based recognition systems. The arch...

متن کامل

Segmentation and recognition of phonetic features in handwritten Pitman shorthand

There is a wish to be able to enter text into mobile computing devices at the speed of speech. Only handwritten shorthand schemes can achieve this data recording rate. A new, overall solution to the segmentation and recognition of phonetic features in Pitman shorthand is proposed in this paper. Approaches to the recognition of consonant outlines, vowel and diphthong symbols and shortforms, whic...

متن کامل

Spectral and temporal modulation features for phonetic recognition

Recently, the modulation spectrum has been proposed and found to be a useful source of speech information. The modulation spectrum represents longer term variations in the spectrum and thus implicitly requires features extracted from much longer speech segments compared to MFCCs and their delta terms. In this paper, a Discrete Cosine Transform (DCT) analysis of the log magnitude spectrum combin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Sensors

سال: 2021

ISSN: ['1424-8220']

DOI: https://doi.org/10.3390/s21186258